Image processing apparatus, display control method and program

ABSTRACT

Aspects of the present invention include an apparatus comprising a recognition unit configured to recognize real object in an image. The apparatus may further comprise a determining unit configured to determine a stability indicator indicating a stability of the recognition, and a display control unit configured to modify a display of a virtual object according to the stability indicator.

CROSS REFERENCE TO RELATED APPLICATION

The present application is a continuation application of U.S. patentapplication Ser. No. 15/015,926, filed Feb. 4, 2016, and is acontinuation of U.S. patent application Ser. No. 14/008,789, filed Apr.2, 2012, which is a National Stage of PCT/JP2012/002269, filed Apr. 2,2012, and claims the priority from prior Japanese Priority PatentApplication JP 2011-086684 filed in the Japan Patent Office on Apr. 8,2011, the entire content of which is hereby incorporated by reference.

TECHNICAL FIELD

The present disclosure relates to an image processing apparatus, adisplay control method, and a program.

BACKGROUND ART

In recent years, a technique called augmented reality (AR), whichpresents a user with additional information on the real world in asuperimposed manner, has been attracting attention. In the AR technique,presentations of information to the user are sometimes called“annotations,” and can be visualized using virtual objects of variousforms such as text, icons, or animations. In general AR applications, avirtual object is arranged to be aligned to a position of a real object(e.g., an object present in the real world which may be, for example,recognized in an image of the real world). Further, there areapplications capable of arranging virtual objects in an AR space to bealigned to the posture of the real object as well as the position of thereal object. For example, Patent Literature 1 (“PTL 1”) has proposed atechnique capable of suppressing degradation in accuracy of positionalignment of a virtual object.

CITATION LIST Patent Literature

[PTL 1] JP2010-134649

SUMMARY Technical Problem

However, it is difficult to consistently recognize the position or theposture of a real object in the real word projected in an imageaccurately. That is, when an attempt to match the position or theposture of the virtual object with the real object of the real world ismade, the virtual object may be possibly unsteadily displayed due to atemporal degradation in accuracy of recognition. Particularly, whenunstable recognition is made, it is desirable to appropriately controlthe display of the virtual object and to thereby prevent the user frombeing confused due to the unsteady display of the virtual object.

Solution to Problem

As described above, according to the present disclosure, a user can beprevented from being confused due to the unsteady display of a virtualobject. Disclosed embodiments include an apparatus comprising arecognition unit configured to recognize real object in an image. Theapparatus may further comprise a determining unit configured todetermine a stability indicator indicating a stability of therecognition, and a display control unit configured to modify a displayof a virtual object according to the stability indicator.

Disclosed embodiments further include a method comprising recognizingreal object in an image. The method may further comprise determining astability indicator indicating a stability of the recognition, andmodifying a display of a virtual object, according to the stabilityindicator.

Disclosed embodiments further include a tangibly embodied non-transitorycomputer-readable medium storing instructions which, when executed by aprocessor, perform a method comprising recognizing real object in animage. The method may further comprise determining a stability indicatorindicating a stability of the recognition, and modifying a display of avirtual object, according to the stability indicator.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram for explaining an overview of an image processingapparatus according to an embodiment;

FIG. 2 is a diagram for explaining an arrangement of a virtual object.

FIG. 3 is a diagram for explaining a problem related to an arrangementof a virtual object;

FIG. 4 is a block diagram illustrating an example of a hardwareconfiguration of an image processing apparatus according to anembodiment;

FIG. 5 is a block diagram illustrating an example of a configuration oflogical functions of an image processing apparatus according to anembodiment;

FIG. 6 is a diagram for explaining a state quantity of a recognized realobject in an embodiment;

FIG. 7 is a diagram for explaining a first example of a determinationtechnique for determining stability of recognition;

FIG. 8 is a diagram for explaining a second example of a determinationtechnique for determining stability of recognition;

FIG. 9 is a diagram for explaining a third example of a determinationtechnique for determining stability of recognition;

FIG. 10 is a diagram for explaining a fourth example of a determinationtechnique for determining stability of recognition.

FIG. 11 is a diagram for explaining a fifth example of a determinationtechnique for determining stability of recognition;

FIG. 12 is a diagram for explaining an example of control of a trackingpace of a virtual object for tracking a real object;

FIG. 13 is a diagram for explaining control of sharpness of a virtualobject;

FIG. 14 is a diagram for explaining control of transparency of a virtualobject;

FIG. 15 is a diagram for explaining an example of a display prompting auser to take an action for increasing stability; and

FIG. 16 is a flowchart illustrating an example of the flow of a displaycontrol process according to an embodiment.

DESCRIPTION OF EMBODIMENTS

Hereinafter, embodiments of the present disclosure will be described indetail with reference to the appended drawings. Note that, in thisspecification and the appended drawings, structural elements that havesubstantially the same function and structure are denoted with the samereference numerals, and repeated explanation of these structuralelements is omitted.

A description will be made in the following order.

1. Overview

2. Hardware Configuration Example of Image Processing ApparatusAccording to Embodiment

3. Functional Configuration Example of Image Processing ApparatusAccording to Embodiment

3-1. Image Acquiring Unit

3-2. Image Recognizing Unit

3-3. Detecting Unit

3-4. Determining Unit

3-5. Data Acquiring Unit

3-6. Display Control Unit

4. Flow of Display Control Process According to Embodiment

5. Summary

1. Overview

FIG. 1 is a diagram for explaining an overview of an embodiment of atechnique disclosed in this disclosure. Referring to FIG. 1, an imageprocessing apparatus 100 which a user carries in a real space 1 isshown.

The image processing apparatus 100 is an apparatus that provides theuser with an AR application. For example, the image processing apparatus100 may be, for example, a terminal apparatus such as a smartphone, apersonal digital assistant (PDA), a game machine, or a portable musicplayer or may be an information processing apparatus such as a personalcomputer (PC) or a workstation.

The real space 1 is an example of a space where an AR application can beused. Real objects 12 a, 12 b, 12 c, 12 d, and 12 e are present in thereal space 1. It is to be understood that real space 1 and real objects12 a, 12 b, 12 c, 12 d, and 12 e are merely exemplary. Although each isendowed with forms and features for the purposes of illustration, suchfeatures are arbitrary and not meant to be limiting. The real object 12a is a vehicle. The real object 12 b is a guide plate. The real object12 c is an advertising signboard. The real objects 12 d and 12 e arebuildings. The image processing apparatus 100 acquires an image obtainedby imaging a real space where various real objects can be present as aninput image. Then, the image processing apparatus 100 superimposesvirtual objects for AR on the input image based on a recognition resultof the real objects projected in the input image. Information presentedto the user through the virtual objects may be arbitrary informationsuch as navigation information, advertising information, storeinformation, news, or weather forecast information. In the presentembodiment, the AR application provided by the image processingapparatus 100 arranges a virtual object related to a real object in areal space in an image according to the position or the posture of thereal object.

FIG. 2 is a diagram for explaining an arrangement of a virtual object.An image Im01 shown on the left of FIG. 2 is an example of the inputimage acquired by the image processing apparatus 100. The real objects12 a and 12 b are projected in the input image Im01. An output imageIm02 generated such that virtual objects are superimposed on the inputimage Im01 is shown on the right of FIG. 2. In the output image Im02, avirtual object 13 a is superimposed on the real object 12 a, and virtualobjects 13 b and 14 b are superimposed on the real object 12 b. Thevirtual objects 13 a and 13 b function as indications (indicators)representing a result of recognizing a real object through the imageprocessing apparatus 100. Meanwhile, the virtual object 14 b functionsas information content representing information (advertising informationof a product in the example of FIG. 2) related to the real object 12 b.In the present disclosure, the virtual object may include both theindication and the information content. Alternatively, the virtualobject may include one of the indication or the information content.

In the example of FIG. 2, the virtual object 13 a is arranged accordingto a status of the real object 12 a. The virtual objects 13 b and 14 bare arranged according to a status of the real object 12 b. Howaccurately the virtual object is arranged depends on the accuracy ofrecognizing the status of the real object projected in the image.

FIG. 3 is a diagram for explaining a problem related to an arrangementof a virtual object. Referring to FIG. 3, output images Im02-1, Im02-2,Im02-3 of the AR application, which can be displayed by an existingtechnique, at points in time of a time t=n, a time t=n+1, and a timet=n+2, respectively, are shown. In the output image Im02-1 at the timet=n, an arrangement of the virtual object 13 a matches a status of thereal object 12 a. Similarly, arrangements of the virtual objects 13 band 14 b may match a status of the real object 12 b. However, in theoutput image Im02-2 at the time t=n+1, an arrangement of the virtualobject 13 a has deviated from a status of the real object 12 a. Further,in the output image Im02-3 at the time t=n+2, arrangements of virtualobjects 13 b and 14 b are deviated from a status of the real object 12b. Such deviation may be caused, for example, by the degradation in theaccuracy of recognizing the status of the real object. For example, thedegradation in the recognition accuracy may be suddenly caused byvarious events such as a partial deficit of a real object in an inputimage, blurring of an image, the presence of a plurality of objectshaving a similar external appearance, or a variation in a lightingcondition. Unless an environment is particularly prepared, it may bedifficult to completely exclude causes causing the degradation in therecognition accuracy.

A sudden deviation in the arrangement of the virtual object illustratedin FIG. 3 may not only provide the user with an uncomfortable feelingbut may also bring an undesired result such as the degradation invisibility of information represented by the virtual object or a badimpression of an advertising target product. For example, there is atechnique of suppressing an abrupt change in the position or the postureof a virtual object by applying a low pass filter or a moving averagefilter. However, when the filters are applied, tracking of a virtualobject tracking a recognized real object may be delayed. In this regard,the image processing apparatus 100 according to the present embodiment,which will be described in detail in the next section, determines howstable the recognition accuracy is, and controls a display of a virtualobject based on the determination result.

2. Hardware Configuration Example of Image Processing ApparatusAccording to Embodiment

2-1. Hardware Configuration

FIG. 4 is a block diagram illustrating an example of a hardwareconfiguration of the image processing apparatus 100 according to thepresent embodiment. Referring to FIG. 4, the image processing apparatus100 includes an imaging unit 102, a sensor unit 104, an input unit 106,a storage unit 108, a display unit 112, a communication unit 114, a bus118, and a control unit 120.

Imaging Unit

The imaging unit 102 may include a camera module for imaging an image.The imaging unit 102 may image a real space using an imaging device suchas a charge coupled device (CCD) or a complementary metal oxidesemiconductor (CMOS), and generates an imaged image. The imaged imagegenerated by the imaging unit 102 may configure each frame of a videoinput.

Sensor Unit

The sensor unit 104 may include a group of motion sensors that measure amotion of the imaging unit 102. Examples of the motion sensor includesan acceleration sensor that measures an acceleration of the imaging unit102, a gyro sensor that measures a gradient angle, and a geomagneticsensor that measures a direction toward which the imaging unit 102 isdirected. Further, the sensor unit 104 may include a global positioningsystem (GPS) sensor that receives a GPS signal and measures thelatitude, the longitude, and the altitude of an apparatus.

Input Unit

The input unit 106 is an input device that allows the user to operatethe image processing apparatus 100 or to input information to the imageprocessing apparatus 100. The input unit 106 typically includes a touchsensor that detects a touch which the user makes on a screen of thedisplay unit 112. Instead of (or in addition to) the touch sensor, theinput unit 106 may include a pointing device such as a mouse or a touchpad, a gesture recognizing module that recognizes a user's gestureprojected in an image, or a line-of-sight detecting module that includesa head mounted display (HMD) and detects a direction of a user's line ofsight. In addition, the input unit 106 may include any other kind ofinput device such as a keyboard, a keypad, a button, or a switch.

Storage Unit

The storage unit 108 is configured with a storage medium such as asemiconductor memory or a hard disk, and stores a program and data usedfor performing processing by the image processing apparatus 100.Examples of data stored in the storage unit 108 include image datagenerated by the imaging unit 102 and sensor data generated by thesensor unit 104. In addition, for example, model data used when theimage processing apparatus 100 recognizes a real object and object datafor defining a virtual object may be included as data stored in thestorage unit 108.

Display Unit

The display unit 112 may include a display module configured with aliquid crystal display (LCD), an organic light-emitting diode (OLED), acathode ray tube (CRT), or the like. For example, the display unit 112displays an image imaged by the imaging unit 102 or an image of an ARapplication realized by the control unit 120 on a screen. The displayunit 112 may be a screen of the image processing apparatus 100 grippedby the user or an HMD of a through type or a non-through type mounted bythe user.

Communication Unit

The communication unit 114 may include a communication interface thatmediates communication between the image processing apparatus 100 andanother apparatus. The communication unit 114 may support an arbitrarywireless communication protocol or an arbitrary wire-line communicationprotocol, and establishes a communication connection with anotherapparatus.

Bus

The bus 118 connects the imaging unit 102, the sensor unit 104, theinput unit 106, the storage unit 108, the display unit 112, thecommunication unit 114, and the control unit 120 to one another.

Control Unit

The control unit 120 may include a processor such as a centralprocessing unit (CPU) or a digital signal processor (DSP). The controlunit 120 implements various functions of the image processing apparatus100, which will be described later, by executing a program stored in thestorage unit 108 or another storage medium. In particular, the controlunit 120 may include or be connected to a display control unit for,among other things, controlling display unit 112.

3. Functional Configuration Example of Image Processing ApparatusAccording to Embodiment

FIG. 5 is a block diagram illustrating an example of a configuration oflogical functions implemented by the storage unit 108 and the controlunit 120 of the image processing apparatus 100 illustrated in FIG. 4.Referring to FIG. 5, the image processing apparatus 100 includes animage acquiring unit 130, a model database (DB) 138, an imagerecognizing unit 140, a detecting unit 150, a determining unit 160, adata acquiring unit 170, an object DB 172, and a display control unit190.

3-1. Image Acquiring Unit

The image acquiring unit 130 acquires an image obtained by capturing areal space as an input image. For example, the image acquiring unit 130may sequentially acquire each frame of a video input from the imagingunit 102 as the input image. The image acquiring unit 130 may acquireeach frame of a video input, which has been imaged in the past, storedin the storage unit 108 as the input image. The image acquiring unit 130outputs the acquired input image to the image recognizing unit 140.

3-2. Image Recognizing Unit

The image recognizing unit 140 (or “recognizing unit”) may recognize areal object using data stored in the model DB 138. In the model DB 138,model data related to shapes or external appearances of real objectswhich the image processing apparatus 100 regards as recognition targetsis accumulated in advance. In the present embodiment, the targetrecognized by the image processing apparatus 100 may be an arbitraryobject present in a real space such as the real objects 12 a to 12 eillustrated in FIG. 1. Examples of the model data include data definingthe shape of each real object, image data such as a predetermined symbolmark or a text label attached to each real object, and data of a featurequantity set extracted from a known image of each real object.

The image recognizing unit 140 may recognize a real object projected inthe input image input from the image acquiring unit 130 and recognizes astatus of the real object projected in the input image. A status of thereal object recognized by the image recognizing unit 140 may include atleast one of the position and the posture of the real object. In thepresent embodiment, the image recognizing unit 140 recognizes theposition, the posture, and the scale of the real object in the image.For example, the image recognizing unit 140 may collate a set of featurepoints extracted from the input image with the shape of the real objectwhich can be defined by the model data. The image recognizing unit 140may collate the image data such as the symbol mark or the text labelwhich can be defined by the model data with the input image. Inaddition, the image recognizing unit 140 may collate a known featurequantity of an image of the real object which can be defined by themodel data with the feature quantity extracted from the input image.Even in any of the above cases, the image recognizing unit 140 canrecognize that a real object whose collation score is higher than apredetermined threshold value is being projected in the input image atthe position, the posture, and the scale corresponding to the collationscore.

FIG. 6 is a diagram for explaining a state quantity of a real objectrecognized by the image recognizing unit 140 in the present embodiment.Referring to FIG. 6, a global coordinate system formed by an x axis anda y axis on a plane vertical to an optical axis direction of an imagingapparatus that has imaged the input image and a z axis parallel to theoptical axis direction is shown. A position P of a real object 12recognized by the image recognizing unit 140 is given as positioncoordinates of a reference point of the real object in the globalcoordinate system. A posture W of the real object 12 is given as arotational angle of each axis periphery. A scale Sc of the real object12 is given as a magnification for each axis direction. In the presentdisclosure, for simplicity of description, the position P, the postureW, and the scale Sc of each real object are described as discretevariables (state quantities). However, the position P, the posture W,and the scale Sc of each real object may be expressed, in an integratedmanner, by, for example, one 4 by 4 homogeneous transformation matrixrepresenting transformation between the global coordinate system and amodel coordinate system of each real object. In this case, each statequantity (that is, each of the position P, the posture W, and the scaleSc) is extracted from one homogeneous transformation matrix asnecessary, and then used. The global coordinate system may be acoordinate system representing relative coordinates using the positionof the imaging apparatus as an original point. Alternatively, the globalcoordinate system may be a coordinate system representing absolutecoordinates fixedly defined in a real space. The image recognizing unit140 outputs an identifier, the position P, the posture W, and the scaleSc of each real object projected in the input image to the determiningunit 160, the data acquiring unit 170 and the display control unit 190.

3-3. Detecting Unit

The detecting unit 150 detects a user input, and outputs the user inputto the display control unit 190. Examples of the user input detected bythe detecting unit 150 include a touch input to a touch sensor of theinput unit 106, pressing of a button, a motion of an apparatus measuredby a sensor group of the sensor unit 104 (for example, a gradient,shaking, or the like), a user's gesture, and a user's voice.

3-4. Determining Unit

The determining unit 160 may determine stability of recognition whichthe image recognizing unit 140 makes on the status of the real objectprojected in the input image. The stability determined by thedetermining unit 160 may be represented by a continuous value or adiscrete value (for example, a stepwise value having three or more stepsor two values of “high” and “low”). Five examples of a method ofdetermining stability by the determining unit 160 will be describedbelow with reference to FIGS. 7 and 11.

(1) First Example

For example, when recognition made by the image recognizing unit 140 isunstable, a change in the state quantity of the real object is likely tobe drastic. Thus, the determining unit 160 may determine the stabilitybased on a temporal change in the state quantity of the real objectrecognized by the image recognizing unit 140. For example, the temporalchange in the state quantity of the real object can be detected bycomparing a frequency characteristic of the state quantity of the realobject, a differential (or a second order differential) of the statequantity, or a history of the state quantity within a predetermined timeperiod with a current value of the state quantity.

FIG. 7 illustrates an example of the temporal change in a state quantityof the real object. In FIG. 7, a horizontal axis denotes a time t, and avertical axis denotes a state quantity X and a first order differential(dX/dt) thereof. The state quantity may be any one of the position P,the posture W, and the scale Sc on a certain real object or acombination of two or more thereof.

In the example of FIG. 7, the state quantity X gently changes during atime period between a time T0 and a time T1. In this case, for example,when a frequency component of the state quantity X is extracted by theFourier transform, a high frequency component of the state quantity Xdecreases. An absolute value of the first order differential of thestate quantity X does not exceed a certain level (a level correspondingto a realistic limit at which the real object moves relative to theimage apparatus). This is similarly applied to the second orderdifferential of the state quantity X. Further, since the change in thestate quantity X is gentle, a current value of the state quantity X doesnot significantly differ from a history of previous m samples of thestate quantity X. The history of the state quantity X may be a simpleaverage value of the previous m samples or a weighted average value ofthe previous m samples (e.g., the more recent the sample is, the largera set weight).

On the other hand, the state quantity X drastically changes during atime period between the time T1 and a time T2. In this case, when thefrequency component of the state quantity X is extracted, the highfrequency component of the state quantity X further increases. Further,the first order differential of the state quantity X oscillates aroundzero. The absolute value of the first order differential is higher thanthe predetermined level. In addition, a current value of the statequantity X may significantly differ from the history of the previous msamples of the state quantity X.

Thus, in a first determination technique, the determining unit 160evaluates a temporal change in the state quantity by comparing one ofthe magnitude of the high frequency component of the state quantity ofthe real object, an absolute value (an integrated value) of the firstorder differential (or the second order differential) of the statequantity, and the history of the state quantity with the current valueof the state quantity. Then, the determining unit 160 determines thatthe more drastic the evaluated temporal change is, the lower thestability of recognition is. In the example of FIG. 7, it is determinedthat the stability of the time period between T0 and T1 is “high,” thestability of the time period between T1 and T2 is “low,” the stabilityof the time period between T2 and T3 is “medium,” and the stability ofthe time period between T3 and T4 is “low.”

(2) Second Example

When recognition made by the image recognizing unit 140 is unstable, achange in the state of the real object, which may occur rarely in arealistic situation, is likely to be shown in a recognition result. Forexample, when a real object, which remains stationary in a real space,is imaged, the position or the scale of the real object in the image maychange depending on a change in an angle of view. However, in this case,it is rare for the real object to significantly rotate at the same time.In this regard, in a second determination technique, the determiningunit 160 may determine the stability based on consistency between aplurality of state quantities of the real object recognized by the imagerecognizing unit 140.

Referring to FIG. 8, a recognition result, obtained by the imagerecognizing unit 140 during a time period between a time T1 and a timeT2, represents that a real object 12 is moving and the posture of thereal object 12 is rotating. Movement of the real object 12 during thetime period between the time T1 and the time T2 may be represented by adifference dP in the position P, and rotation may be represented by adifference dW in the posture W. Thus, for example, when the movement dPis larger than a first threshold value and the rotation dW is largerthan a second threshold value, since the change in the status of thereal object is unnatural, the determining unit 160 can determine thatthe stability of recognition made by the image recognizing unit 140 isdecreasing.

(3) Third Example

In a third determination technique, the determining unit 160 determinesthe stability based on consistency between the change in the statequantity of the real object and a motion of the imaging unit 102measured by a motion sensor.

Referring to FIG. 9, the user is moving the image processing apparatus100 so that an angle of view of the imaging unit 102 can change during atime period between a time T1 and a time T2. As a result, the realobject 12 b projected on the lower right of an input image Im11-1 isbeing moved to the upper left of an input image Im11-2. A change in anangle of view of the imaging unit 102 can be recognized based on sensordata from a motion sensor (for example, an acceleration sensor, a gyrosensor, or a geomagnetic sensor) of the sensor unit 104. In this case,when the movement dP or the rotation dW of the real object 12 b is notconsistent with a motion of the image processing apparatus 100recognized based on the sensor data, the determining unit 160 candetermine that the stability of recognition made by the imagerecognizing unit 140 is decreasing.

(4) Fourth Example

In a fourth determination technique, the determining unit 160 determinesthe stability based on consistency between a change in a state quantityof a real object recognized by the image recognizing unit 140 and adirection of a motion of a real space shown in a plurality of inputimages. For example, a direction of a motion of a real space shown in aplurality of input images may be recognized based on an optical flowobtained from a plurality of input images.

Referring to FIG. 10, an optical flow F from an input image Im12-1imaged at a time T1 to an input image Im12-2 imaged at a time T2 isshown. The optical flow F represents a motion of a real object which ismoved toward the upper left of the image. On the other hand, arecognition result obtained by the image recognizing unit 140 representsthat the real object 12 b is rotating while moving upward. As describedabove, when the movement dP or the rotation dW of the real object 12 bis not consistent with the optical flow formed by a plurality of inputimages, the determining unit 160 can determine that the stability ofrecognition made by the image recognizing unit 140 is decreasing.Further, the stability of the real object may be calculated as thecontinuous value, based on a correlation (for example, a scalar product)between a motion vector of the real object predicted from the opticalflow around the real object 12 b and a motion vector of the real objectrecognized as a result of recognizing the real object (for example, thelarger the scalar product is, the more stable the recognition is).

(5) Fifth Example

For example, when a part of a real object is not projected in an inputimage, a probability that the accuracy of recognition will decrease ishigh. In this regard, in a fifth determination technique, when a part ofa recognized real object is not projected in an input image, thedetermining unit 160 determines that the stability of recognition islower than when the whole real object is projected in the input image.For example, a part of a recognized real object may not be projected inan input image when a part of the real object is positioned outside anangle of view, or when a part of the real object is hidden by anotherreal object (in case of so-called occlusion).

Referring to FIG. 11, the real objects 12 a and 12 b are projected in aninput image Im13. A part of the real object 12 a is hidden by the realobject 12 b. A part of the real object 12 b is out of an angle of view.A plurality of triangular symbols illustrated in FIG. 11 represent thepositions of feature points which can be used when the image recognizingunit 140 recognizes the real object 12 a. For example, the determiningunit 160 calculates a ratio of the number of feature points of the realobject 12 a recognized by the image recognizing unit 140 to the totalnumber of feature points thereof. When the calculated ratio is lowerthan a predetermined threshold value, the determining unit 160 candetermine that a part of the real object 12 is not projected in theinput image. Further, for example, when the center of the real object isoutside a stable detection region of an image, the determining unit 160can easily determine that there is a high possibility that a part of thereal object is not projected in the input image. In the example of FIG.11, a center Pa of the real object 12 a is located inside a stabledetection region R, whereas a center Pb of the real object 12 b islocated outside the stable detection region R. The stable detectionregion R can be set to the center of the input image so that a boundarythereof can have a predetermined offset from the edge of the inputimage.

The determining unit 160 determines the stability of recognition made bythe image recognizing unit 140 using the above described techniques, andoutputs the stability of each real object to the display control unit190. The image processing apparatus 100 may allow the determining unit160 to select two or more of the above described techniques, and mayadaptively change a technique to be used by the determining unit 160according to a condition. For example, when stability is determined on adynamic real object, the first technique may be selected, whereas whenstability is determined on a stationary real object, any one of thesecond to fourth techniques may be selected. Further, the fifthtechnique may be selected in a scene in which a small processing cost isdesirable.

3-5. Data Acquiring Unit

The data acquiring unit 170 acquires data related to the virtual objectto be superimposed on the input image through the display control unit190. The data acquired by the data acquiring unit 170 includes objectdata defining the virtual object. Examples of the object data includethe type of virtual object, an identifier of an associated real object,a relative display position from the real object, and data defining thecontent of the information content. The data acquiring unit 170 mayacquire object data which is previously stored in the object DB 172.Alternatively, the data acquiring unit 170 may acquire newest objectdata from a data server installed in the real space via thecommunication unit 114. For example, the object data provided from thedata server may be data which differs according to the position of theimage processing apparatus 100 measured by the sensor unit 104. The dataacquiring unit 170 outputs the object data to the display control unit190.

3-6. Display Control Unit

The display control unit 190 may control a display of the virtual objectby the AR application provided by the image processing apparatus 100.More specifically, for example, the object data of the virtual objectassociated with the real object recognized as being projected in theinput image by the image recognizing unit 140 is input from the dataacquiring unit 170 to the display control unit 190. The display controlunit 190 generates an object image of the virtual object based on theobject data. The object image generated by the display control unit 190is typically an image obtained by projecting the virtual objectvirtually arranged in a three-dimensional real space on an imagingsurface of the image processing apparatus 100. Then, the display controlunit 190 outputs the generated output image to the display unit 112. Asa result, an image of the AR application is presented to the user.

An arrangement of a virtual object is decided according to a status ofan associated real object. That is, the display control unit 190 causesthe virtual object to be displayed on a display device so that thevirtual object can track the position, the posture, and the scale ofeach real object.

In the present embodiment, the display control unit 190 changes adisplay of the virtual object associated with each recognized realobject according to the stability of recognition made by the imagerecognizing unit 140. For example, the display control unit 190 changesa tracking pace of the virtual object for tracking each real objectaccording to the stability. Typically, the display control unit 190increases the tracking pace of the virtual object for tracking each realobject when the stability is high, but decreases the tracking pace ofthe virtual object for tracking each real object when the stability islow. For example, the tracking pace may be represented by aninterpolation coefficient k in Formula (1):

Z _(n+1) =Z _(n) +k(X _(n+1) −Z _(n))  [Math. 1]

In Formula (1), Z_(n) and Z_(n+1) represent a status of a virtual objectat a time t=n and a status of a virtual object at a time t=n+1, andX_(n+1) represents a status of a recognized real object at a time t=n+1.The display control unit 190 controls a value of the interpolationcoefficient k in a range which is larger than zero but equal to or lessthan 1. For example, when k is 1, Z_(n+1)=X_(n+1), and a status of thevirtual object at a time t=n+1 reflects a status of the recognized realobject as is. However, when k is 0.1, Z_(n+1)=0.1X_(n+1)+0.9X_(n), astatus of the virtual object at a time t=n+1 partially reflects a statusof the recognized real object.

FIG. 12 is a diagram for explaining an example of control of a trackingpace of a virtual object for tracking a real object. Statuses of thevirtual object 13 a at three points in time of times t=n, n+1, and n+2when the tracking pace is constant (when the interpolation coefficient kis constant) are shown in the upper section of FIG. 12. The virtualobject 13 a is a virtual object related to the real object 12 a. Atpoints in time of times t=n+1 and n+2, since the stability ofrecognition is low, the status of the virtual object 13 a issignificantly deviated from the real object 12 a.

Meanwhile, statuses of the virtual object 13 a at three points in timeof times t=n, n+1, and n+2 when the tracking pace is dynamicallycontrolled are shown in the lower section of FIG. 12. In this case, atthe point in time of the time n=t, since the stability is high, thetracking pace is fast, and the status of the real object 12 a is almostconsistent with the status of the virtual object 13 a. Thereafter, atthe points in time of the times t=n+1 and n+2, when the stabilitydecreases, the value of the interpolation coefficient k decreases, andthus the status of the virtual object 13 a gently tracks the status ofthe real object 12 a. As a result, the deviation of the virtual object13 a from the status of the real object 12 a is outwardly suppressed tobe reduced.

The display control unit 190 may change a display attribute of thevirtual object according to the stability instead of (or while) changingthe tracking pace of the virtual object for tracking each real objectaccording to the stability. The display attribute of the virtual objectmay include a static attribute and a dynamic attribute related to adisplay mode of the virtual object. Examples of the static attributerelated to a display mode of the virtual object include transparency,sharpness, color, scale, shape, the number, and an afterimage pattern ofthe virtual object. Examples of a dynamic attribute related to a displaymode of the virtual object include a blinking pattern of the virtualobject, a blinking cycle, and the type of animation. By changing thedisplay attribute of the virtual object, it is possible to cause theuser to intuitively understand the fact that the stability ofrecognition of the real object is decreasing.

As an example, referring to FIG. 13, sharpness of the virtual object 13b is set to high in an input image Im22-2 a whose stability ofrecognition of the real object is determined to be high. As a result,the contour of the virtual object 13 b of the input image Im22-2 a isclearly displayed. However, sharpness of the virtual object 13 b is setto low in an input image Im22-2 b whose stability of recognition of thereal object is determined to be low. As a result, the contour of thevirtual object 13 b of the input image Im22-2 b is dimly displayed.

As another example, referring to FIG. 14, transparency of the virtualobject 13 b is set to low in an input image Im23-2 a whose stability ofrecognition of the real object is determined to be high. As a result,the contour of the virtual object 13 b of the input image Im23-2 a isnot transparent. However, transparency of the virtual object 13 b is setto high in an input image Im23-2 b whose stability of recognition of thereal object is determined to be low. As a result, the contour of thevirtual object 13 b of the input image Im23-2 b is transparent.

In the example of FIG. 14, the transparency of the virtual object 13 b,which functions as the indication for representing a recognition resultof the real object, changes according to the stability of recognition,whereas the transparency of the virtual object 14 b, which functions asthe information content, is maintained constant. As described above, thedisplay control unit 190 may change a display of only one of theindication and the information content. By changing a display of onlythe indication, when the stability of recognition of the real object hasdecreased, it is possible to cause the user to understand thedegradation in the stability without decreasing visibility of theinformation content. On the other hand, by changing a display of onlythe information content, when the stability of recognition of the realobject has decreased, it may be possible to prevent an impression of theinformation content from getting worse while informing the user of howthe real object has been recognized.

When the stability of recognition made by the image recognizing unit 140is low, the display control unit 190 may further cause informationprompting the user to take an action for increasing the stability to bedisplayed on the display device. In this case, for example, theinformation to display may be a message (see a message 15 of an imageIm30 of FIG. 15) prompting movement of an apparatus so that the realobject shown outside an angle of view can be located within an angle ofview. The user who has referred to the information takes an action forincreasing the stability, and thus the unsteady display of the virtualobject can be avoided.

In addition, when a predetermined user input is detected by thedetecting unit 150, the display control unit 190 may cause the displaydevice to perform a display of the virtual object under the assumptionthat the stability is high, regardless of the stability of recognitiondetermined by the determining unit 160. Examples of the predetermineduser input include an input to a touch sensor, an input sensed by amotion sensor, pressing down of a button provided on the imageprocessing apparatus 100, and an arbitrary user input such as the user'sgesture or voice. As a result, when the user allows the unsteady displayof the virtual object, display control according to the stability can betemporarily stopped.

4. Flow of Display Control Process According to Embodiment

FIG. 16 is a flowchart illustrating an example of the flow of a displaycontrol process performed by the image processing apparatus 100according to the present embodiment.

Referring to FIG. 16, first, the image acquiring unit 130 acquires animage projected in a real space as an input image (step S102). Next, theimage recognizing unit 140 may recognize a real object projected in eachinput image input from the image acquiring unit 130 and recognizes astatus of each real object projected in the input image (step S104). Forexample, the status of the recognized real object may include theposition, the posture, and the scale of the real object. Next, thedetermining unit 160 determines whether or not there is a real objectrecognized as being projected in the input image by the imagerecognizing unit 140 (step S106). Here, when there is no recognized realobject, the process returns to step S102. However, when there is arecognized real object, the process proceeds to step S108.

In step S108, the determining unit 160 may determine a stability ofrecognition which the image recognizing unit 140 has made on the statusof the real object projected in the input image. Next, the dataacquiring unit 170 acquires object data of a virtual object related toeach real object recognized by the image recognizing unit 140 (stepS110). Further, the display control unit 190 decides a display attributeof the virtual object related to each recognized real object accordingto the stability of recognition determined by the determining unit 160(step S112). Here, the display attribute may be the interpolationcoefficient k, or transparency, sharpness, a blinking pattern, ablinking cycle, or the scale of an afterimage of the virtual object.Next, the display control unit 190 generates an image of the virtualobject related to each real object according to the decided displayattribute (step S114). Then, the display control unit 190 superimposesthe generated image of the virtual object on the input image (stepS116).

5. Summary

The image processing apparatus 100 according to an embodiment has beendescribed in detail so far with reference to FIGS. 1 to 16. According tothe present embodiment, a display of the virtual object related to thereal object dynamically changes according to the stability ofrecognition made on the status of the real object projected in theimage. As a result, it is possible to prevent the user from beingconfused due to the unsteady display of the virtual object caused by thedegradation in the accuracy of recognition.

Further, according to the present embodiment, an application, whichallows the virtual object to track the position or the posture of thereal object, may control the tracking pace of the virtual object fortracking the real object according to the stability, for example. As aresult, a tracking delay can be kept small while the accuracy ofrecognition is high, and sudden deviation in an arrangement of thevirtual object when the accuracy of recognition has been lowered can beeffectively reduced.

Further, according to the present embodiment, a display attribute of thevirtual object, such as transparency, sharpness, a blinking pattern, ablinking cycle, or the scale of an afterimage of the virtual object, canbe controlled according to the stability. As a result, since the usermay intuitively understand the fact that the stability of recognition ofthe real object is decreasing, it is possible to prevent the user frombeing confused when a display of the virtual object is unsteady.

Further, the stability can be determined based on the temporal change inthe state quantity of the real object. In this case, the stability canbe accurately determined, regardless of whether a target real object isa dynamic real object or a stationary real object, without needing anauxiliary input by a motion sensor or the like. Further, the stabilitymay be determined based on consistency between the recognized statequantities, consistency between the state quantity and a motion of anapparatus, or consistency between the state quantity and a direction ofa motion of a real space shown in the input image. In this case, thestability can be determined without needing a calculation such asanalysis or a differential of a frequency of the state quantity of thereal object. Further, the stability can be easily determined accordingto whether or not the real object has a part which is not projected inthe input image.

The processing by the image processing apparatus 100 described in thepresent disclosure may be implemented using software, hardware, or acombination of software and hardware. For example, a program configuringsoftware is previously stored in a storage medium provided inside oroutside each apparatus. For example, each program is read to a randomaccess memory (RAM) at the time of execution and executed by a processorsuch as a CPU.

The present embodiment has been described in connection with the examplein which an image is mainly displayed on a screen of the display unit112 of the image processing apparatus 100. However, as anotherembodiment, an image processing apparatus that receives an input imagefrom a terminal apparatus carried by a user may recognize the positionor the posture of a real object projected in the input image and controla display of a virtual object related to the real object according to astability of recognition. In this case, an image for displaying thevirtual object may be transmitted from the image processing apparatus tothe terminal apparatus and then displayed on a screen of the terminalapparatus.

In the present disclosure, an index for identifying the degradation inthe accuracy of recognition has been described using the term“stability,” however, an index having the substantially same technicalmeaning may be represented by another term such as “instability.”

Although preferred embodiments of the present disclosure are describedabove with reference to the appended drawings, the present disclosure isnot limited thereto. It should be understood by those skilled in the artthat various modifications, combinations, sub-combinations andalterations may occur depending on design requirements and other factorsinsofar as they are within the scope of the appended claims or theequivalents thereof.

Note that the following configurations are within the scope of thepresent disclosure.

(1) An image processing apparatus, comprising:

a recognizing unit configured to recognize a position or a posture of areal object projected in an image; anda display control unit configured to change a display of a virtualobject related to the real object according to stability of recognitionmade by the recognizing unit.

(2) The image processing apparatus according to (1), wherein the displaycontrol unit displays the virtual object on a display device for thevirtual object to track a position or a posture of the real object.

(3) The image processing apparatus according to (2), wherein the displaycontrol unit changes a tracking pace of the virtual object for the realobject according to the stability.

(4) The image processing apparatus according to (3), wherein the displaycontrol unit set the tracking pace of the virtual object for the realobject faster when the stability is higher.

(5) The image processing apparatus according to any one of (1) to (4),wherein the display control unit changes either or both of a staticattribute and a dynamic attribute related to a display mode of thevirtual object according to the stability.

(6) The image processing apparatus according to any one of (1) to (5),further comprising a determining unit configured to determine thestability.

(7) The image processing apparatus according to (6), wherein thedetermining unit determines the stability based on a temporal change ina state quantity of the real object recognized by the recognizing unit,and the state quantity of the real object relates to at least one of aposition, a posture, and a scale of the real object.

(8) The image processing apparatus according to (6), wherein

the determining unit determines the stability based on consistencybetween a first state quantity and a second state quantity of the realobject recognized by the recognizing unit, andeach of the first state quantity and the second state quantity of thereal object relates to at least one of a position, a posture, and ascale of the real object.

(9) The image processing apparatus according to (6), wherein, when thereal object recognized by the recognizing unit includes a part which isnot projected in the image, the determining unit determines that thestability is lower than when the whole real object is projected in theimage.

(10) The image processing apparatus according to (6), furthercomprising:

an imaging unit configured to capture the image; anda motion sensor configured to measure a motion of the imaging unit,wherein the determining unit determines the stability based onconsistency between a change in a state quantity of the real objectrecognized by the recognizing unit and a motion of the imaging unitmeasured by the motion sensor, andthe state quantity of the real object relates to at least one of aposition, a posture, and a scale of the real object.

(11) The image processing apparatus according to (6), wherein thedetermining unit determines the stability based on consistency between achange in a state quantity of the real object recognized by therecognizing unit and a direction of a motion of a real space shown in aplurality of input images, and

the state quantity of the real object relates to at least one of aposition, a posture, and a scale of the real object.

(12) The image processing apparatus according to any one of (1) to (11),wherein, when the stability is low, the display control unit displaysinformation prompting a user to take an action for increasing thestability.

(13) The image processing apparatus according to any one of (1) to (12),wherein, when a predetermined user input is detected, the displaycontrol unit causes a display device to perform a display of the virtualobject under the assumption that the stability is high regardless of thestability of recognition made by the recognizing unit.

(14) The image processing apparatus according to any one of (1) to (13),wherein the virtual object is an indication representing that the realobject has been recognized.

(15) The image processing apparatus according to any one of (1) to (13),wherein the virtual object is information content related to therecognized real object.

(16) The image processing apparatus according to any one of (1) to (13),wherein the display control unit causes an indication representing thatthe real object has been recognized and information content related tothe recognized real object to be displayed on a display device as thevirtual objects, and changes a display of either of the indication andthe information content according to the stability.

(17) A display control method, comprising:

recognizing a position or a posture of a real object projected in animage; andchanging a display of a virtual object related to the real objectaccording to stability of the recognition.

(18) A program causing a computer that controls an image processingapparatus to function as:

a recognizing unit configured to recognize a position or a posture of areal object projected in an image; anda display control unit configured to change a display of a virtualobject related to the real object according to stability of recognitionmade by the recognizing unit.

REFERENCE SIGNS LIST

-   100 Image processing apparatus-   102 Imaging unit-   104 Sensor unit-   140 Image recognizing unit-   160 Determining unit-   190 Display control unit

What is claimed is:
 1. An information processing apparatus, comprising:one or more processors configured to: recognize a real object in animage; compare a current value of a state quantity of the real object inthe image with at least one prior value of the state quantity of thereal object in the image; determine a stability indicator that indicatesa stability of the recognition of the real object based on thecomparison; and control a display of a virtual object based on thestability indicator.
 2. The information processing apparatus accordingto claim 1, wherein the one or more processors are further configured tocontrol the display of the virtual object by modification of at leastone attribute related to the display of the virtual object.
 3. Theinformation processing apparatus according to claim 2, wherein themodified at least one attribute of the virtual object is at least one ofa static attribute or a dynamic attribute.
 4. The information processingapparatus according to claim 3, wherein the dynamic attribute comprisesat least one of an animation related to the display of the virtualobject, a pattern displayed on the display of the virtual object, or atiming of display of the pattern displayed on the display of the virtualobject.
 5. The information processing apparatus according to claim 2,wherein the modified at least one attribute comprises an indication to auser of a change in the stability indicator.
 6. The informationprocessing apparatus according to claim 1, wherein the one or moreprocessors are further configured to control the display of the virtualobject based on a tracked position or orientation of the real object. 7.The information processing apparatus according to claim 6, wherein thetracked position is tracked at a tracking pace, and the one or moreprocessors are further configured to select the tracking pace based onthe stability indicator.
 8. The information processing apparatusaccording to claim 1, wherein the one or more processors are furtherconfigured to determine the stability indicator by: comparison of thecurrent value of a first aspect of the state quantity of the real objectto a prior value of the first aspect of the state quantity; andcomparison of the current value of a second aspect of the state quantitywith a prior value of the second aspect of the state quantity.
 9. Theinformation processing apparatus according to claim 8, wherein the firstaspect and the second aspect of the state quantity include a position ofthe real object and an orientation of the real object, respectively. 10.The information processing apparatus according to claim 1, wherein theone or more processors are further configured to: control to modify thedisplay of the virtual object to include an indicator of the recognitionof the real object; and superimpose an informational display of thevirtual object onto at least one of the image or the modified display ofthe virtual object.
 11. The information processing apparatus accordingto claim 1, wherein the comparison of the current value of the statequantity of the real object with the at least one prior value of thestate quantity of the real object indicates a temporal change in thestate quantity within a determined time period.
 12. A method,comprising: recognizing a real object in an image; comparing a currentvalue of a state quantity of the real object in the image with at leastone prior value of the state quantity of the real object in the image;determining a stability indicator indicating a stability of therecognition of the real object based on the comparison; and controllinga display of a virtual object based on the stability indicator.
 13. Themethod according to claim 12, wherein the comparison of the currentvalue of the state quantity of the real object with the at least oneprior value of the state quantity of the real object indicates atemporal change in the state quantity within a determined time period.14. A non-transitory computer-readable storage medium having storedthereon, computer-executable instructions for causing an informationprocessing apparatus to execute operations, the operations comprising:recognizing a real object in an image; comparing a current value of astate quantity of the real object in the image with at least one priorvalue of the state quantity of the real object in the image; determininga stability indicator indicating a stability of the recognition of thereal object based on the comparison; and controlling a display of avirtual object based on the stability indicator.